Relieving Polysemy Problem for Synonymy Detection
نویسندگان
چکیده
In order to automatically identify noun synonyms, we propose a new idea which opposes classical polysemous representations of words to monosemous representations based on the “one sense per discourse” hypothesis. For that purpose, we apply the attributional similarity paradigm on two levels: corpus and document. We evaluate our methodology on well-known standard multiple choice synonymy question tests and evidence that it steadily outperforms the baseline.
منابع مشابه
Clustering tweets usingWikipedia concepts
Two challenging issues are notable in tweet clustering. Firstly, the sparse data problem is serious since no tweet can be longer than 140 characters. Secondly, synonymy and polysemy are rather common because users intend to present a unique meaning with a great number of manners in tweets. Enlightened by the recent research which indicates Wikipedia is promising in representing text, we exploit...
متن کاملA Frame-based Approach to Polysemous Near-synonymy: the Case with Mandarin Verbs of Expression
In this paper, we propose a frame-based approach to polysemy by analyzing three near-synonymous verbs biaoshi (表示), biaoda (表達) and biaolu (表露). Based on Liu and Wu (2004), this paper further discusses the cross-frame phenomena of near-synonyms with a detailed comparison of their syntactic and collocational patterns. It is shown that polysemy among related verbs may be well defined and manifest...
متن کاملRelative Synonymy and Conceptual Vectors
Synonymy is a pivot relation in NLP but remains problematic. Putting forward, we introduce the notion of relative synonymy, to circumvent some diÆculties among which possible polysemy and contextual interpretation. In the framework of conceptual vectors, it is then possible to formalize test functions for synonymy and to experiment their use in thematic analysis that will help text classi cation.
متن کاملConcept Extraction and Synonymy Management for Biomedical Information Retrieval
This paper reports on work done for the Genomics Track at TREC 2004 by ConverSpeech LLC in conjunction with scientists at the Saccharomyces Genome Database (SGD), the model organism database located at Stanford University, California. The rapidly increasing number of articles in the biomedical literature has created new urgency for software tools that find information relevant to specific infor...
متن کاملAn approach to classify synonyms in a dictionary of verbs
Several works in computational linguistics try to study the relationships among dictionary entries. These works consider the dictionary as a graph where words are represented by vertices and relationships between two words by an arrow between the corresponding vertices. Several kinds of relationships exist between two words such as synonymy, antonymy, hyperonymy, hyponymy ... We define the prox...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009